84 research outputs found

    MiMiR - an integrated platform for microarray data sharing, mining and analysis

    Get PDF
    Background: Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data.Results: A user friendly Online Annotation Tool allows researchers to submit detailed experimental information via the web at the time of data generation rather than at the time of publication. This ensures the easy access and high accuracy of meta-data collected. Experiments are programmatically built in the MiMiR database from the submitted information and details are systematically curated and further annotated by a team of trained annotators using a new Curation and Annotation Tool. Clinical information can be annotated and coded with a clinical Data Mapping Tool within an appropriate ethical framework. Users can visualise experimental annotation, assess data quality, download and share data via a web-based experiment browser called MiMiR Online. All requests to access data in MiMiR are routed through a sophisticated middleware security layer thereby allowing secure data access and sharing amongst MiMiR registered users prior to publication. Data in MiMiR can be mined and analysed using the integrated EMAAS open source analysis web portal or via export of data and meta-data into Rosetta Resolver data analysis package.Conclusion: The new MiMiR suite of software enables systematic and effective capture of extensive experimental and clinical information with the highest MIAME score, and secure data sharing prior to publication. MiMiR currently contains more than 150 experiments corresponding to over 3000 hybridisations and supports the Microarray Centre's large microarray user community and two international consortia. The MiMiR flexible and scalable hardware and software architecture enables secure warehousing of thousands of datasets, including clinical studies, from microarray and potentially other -omics technologies

    Determining Frequent Patterns of Copy Number Alterations in Cancer

    Get PDF
    Cancer progression is often driven by an accumulation of genetic changes but also accompanied by increasing genomic instability. These processes lead to a complicated landscape of copy number alterations (CNAs) within individual tumors and great diversity across tumor samples. High resolution array-based comparative genomic hybridization (aCGH) is being used to profile CNAs of ever larger tumor collections, and better computational methods for processing these data sets and identifying potential driver CNAs are needed. Typical studies of aCGH data sets take a pipeline approach, starting with segmentation of profiles, calls of gains and losses, and finally determination of frequent CNAs across samples. A drawback of pipelines is that choices at each step may produce different results, and biases are propagated forward. We present a mathematically robust new method that exploits probe-level correlations in aCGH data to discover subsets of samples that display common CNAs. Our algorithm is related to recent work on maximum-margin clustering. It does not require pre-segmentation of the data and also provides grouping of recurrent CNAs into clusters. We tested our approach on a large cohort of glioblastoma aCGH samples from The Cancer Genome Atlas and recovered almost all CNAs reported in the initial study. We also found additional significant CNAs missed by the original analysis but supported by earlier studies, and we identified significant correlations between CNAs

    Integrated Genomic and Gene Expression Profiling Identifies Two Major Genomic Circuits in Urothelial Carcinoma

    Get PDF
    Similar to other malignancies, urothelial carcinoma (UC) is characterized by specific recurrent chromosomal aberrations and gene mutations. However, the interconnection between specific genomic alterations, and how patterns of chromosomal alterations adhere to different molecular subgroups of UC, is less clear. We applied tiling resolution array CGH to 146 cases of UC and identified a number of regions harboring recurrent focal genomic amplifications and deletions. Several potential oncogenes were included in the amplified regions, including known oncogenes like E2F3, CCND1, and CCNE1, as well as new candidate genes, such as SETDB1 (1q21), and BCL2L1 (20q11). We next combined genome profiling with global gene expression, gene mutation, and protein expression data and identified two major genomic circuits operating in urothelial carcinoma. The first circuit was characterized by FGFR3 alterations, overexpression of CCND1, and 9q and CDKN2A deletions. The second circuit was defined by E3F3 amplifications and RB1 deletions, as well as gains of 5p, deletions at PTEN and 2q36, 16q, 20q, and elevated CDKN2A levels. TP53/MDM2 alterations were common for advanced tumors within the two circuits. Our data also suggest a possible RAS/RAF circuit. The tumors with worst prognosis showed a gene expression profile that indicated a keratinized phenotype. Taken together, our integrative approach revealed at least two separate networks of genomic alterations linked to the molecular diversity seen in UC, and that these circuits may reflect distinct pathways of tumor development

    Distinct Early Molecular Responses to Mutations Causing vLINCL and JNCL Presage ATP Synthase Subunit C Accumulation in Cerebellar Cells

    Get PDF
    Variant late-infantile neuronal ceroid lipofuscinosis (vLINCL), caused by CLN6 mutation, and juvenile neuronal ceroid lipofuscinosis (JNCL), caused by CLN3 mutation, share clinical and pathological features, including lysosomal accumulation of mitochondrial ATP synthase subunit c, but the unrelated CLN6 and CLN3 genes may initiate disease via similar or distinct cellular processes. To gain insight into the NCL pathways, we established murine wild-type and CbCln6nclf/nclf cerebellar cells and compared them to wild-type and CbCln3Δex7/8/Δex7/8 cerebellar cells. CbCln6nclf/nclf cells and CbCln3Δex7/8/Δex7/8 cells both displayed abnormally elongated mitochondria and reduced cellular ATP levels and, as cells aged to confluence, exhibited accumulation of subunit c protein in Lamp 1-positive organelles. However, at sub-confluence, endoplasmic reticulum PDI immunostain was decreased only in CbCln6nclf/nclf cells, while fluid-phase endocytosis and LysoTracker® labeled vesicles were decreased in both CbCln6nclf/nclf and CbCln3Δex7/8/Δex7/8 cells, though only the latter cells exhibited abnormal vesicle subcellular distribution. Furthermore, unbiased gene expression analyses revealed only partial overlap in the cerebellar cell genes and pathways that were altered by the Cln3Δex7/8 and Cln6nclf mutations. Thus, these data support the hypothesis that CLN6 and CLN3 mutations trigger distinct processes that converge on a shared pathway, which is responsible for proper subunit c protein turnover and neuronal cell survival

    Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

    Get PDF
    BACKGROUND: Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. METHODS: We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. RESULTS: We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. CONCLUSION: This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients

    Genetic analysis of multifocal superficial urothelial cancers by array-based comparative genomic hybridisation

    Get PDF
    The purpose of this study was to investigate the accumulation of genetic alterations during metachronous and/or synchronous development of multifocal low-grade superficial urothelial tumours in the same patient, by using array-based comparative genomic hybridisation (array-CGH) and FGFR mutation analysis. We analysed 24 tumours (pTa-1 G1-2) from five patients. We had previously identified a clonal relationship among the tumours of each patient by microsatellite analysis. This time, unsupervised hierarchical cluster analysis revealed that the tumours from each patient were clustered together independently of the tumours from the other patients. All of the tumours from a single patient showed a set of 2–7 identical regional or whole-arm chromosomal changes. In addition, several individual alterations were also found. Cladistic diagrams revealed that the accumulation of genetic alterations could not be explained by a linear model, and the existence of a hypothetical precursor cell was assumed in four patients. In some cases, FGFR mutation seemed to occur later during multifocal tumour development. Taken together, these findings suggest that low-grade superficial urothelial tumours accumulate minor genetic alterations during multifocal development, although these tumours are genetically stable

    Chromosomal Aberrations in Bladder Cancer: Fresh versus Formalin Fixed Paraffin Embedded Tissue and Targeted FISH versus Wide Microarray-Based CGH Analysis

    Get PDF
    Bladder carcinogenesis is believed to follow two alternative pathways driven by the loss of chromosome 9 and the gain of chromosome 7, albeit other nonrandom copy number alterations (CNAs) were identified. However, confirmation studies are needed since many aspects of this model remain unclear and considerable heterogeneity among cases has emerged. One of the purposes of this study was to evaluate the performance of a targeted test (UroVysion assay) widely used for the detection of Transitional Cell Carcinoma (TCC) of the bladder, in two different types of material derived from the same tumor. We compared the results of UroVysion test performed on Freshly Isolated interphasic Nuclei (FIN) and on Formalin Fixed Paraffin Embedded (FFPE) tissues from 22 TCCs and we didn't find substantial differences. A second goal was to assess the concordance between array-CGH profiles and the targeted chromosomal profiles of UroVysion assay on an additional set of 10 TCCs, in order to evaluate whether UroVysion is an adequately sensitive method for the identification of selected aneuploidies and nonrandom CNAs in TCCs. Our results confirmed the importance of global genomic screening methods, that is array based CGH, to comprehensively determine the genomic profiles of large series of TCCs tumors. However, this technique has yet some limitations, such as not being able to detect low level mosaicism, or not detecting any change in the number of copies for a kind of compensatory effect due to the presence of high cellular heterogeneity. Thus, it is still advisable to use complementary techniques such as array-CGH and FISH, as the former is able to detect alterations at the genome level not excluding any chromosome, but the latter is able to maintain the individual data at the level of single cells, even if it focuses on few genomic regions

    Prioritization and Evaluation of Depression Candidate Genes by Combining Multidimensional Data Resources

    Get PDF
    Large scale and individual genetic studies have suggested numerous susceptible genes for depression in the past decade without conclusive results. There is a strong need to review and integrate multi-dimensional data for follow up validation. The present study aimed to apply prioritization procedures to build-up an evidence-based candidate genes dataset for depression.Depression candidate genes were collected in human and animal studies across various data resources. Each gene was scored according to its magnitude of evidence related to depression and was multiplied by a source-specific weight to form a combined score measure. All genes were evaluated through a prioritization system to obtain an optimal weight matrix to rank their relative importance with depression using the combined scores. The resulting candidate gene list for depression (DEPgenes) was further evaluated by a genome-wide association (GWA) dataset and microarray gene expression in human tissues.A total of 5,055 candidate genes (4,850 genes from human and 387 genes from animal studies with 182 being overlapped) were included from seven data sources. Through the prioritization procedures, we identified 169 DEPgenes, which exhibited high chance to be associated with depression in GWA dataset (Wilcoxon rank-sum test, p = 0.00005). Additionally, the DEPgenes had a higher percentage to express in human brain or nerve related tissues than non-DEPgenes, supporting the neurotransmitter and neuroplasticity theories in depression.With comprehensive data collection and curation and an application of integrative approach, we successfully generated DEPgenes through an effective gene prioritization system. The prioritized DEPgenes are promising for future biological experiments or replication efforts to discover the underlying molecular mechanisms for depression

    The endogenous and reactive depression subtypes revisited: integrative animal and human studies implicate multiple distinct molecular mechanisms underlying major depressive disorder

    Get PDF
    Traditional diagnoses of major depressive disorder (MDD) suggested that the presence or absence of stress prior to onset results in either 'reactive' or 'endogenous' subtypes of the disorder, respectively. Several lines of research suggest that the biological underpinnings of 'reactive' or 'endogenous' subtypes may also differ, resulting in differential response to treatment. We investigated this hypothesis by comparing the gene-expression profiles of three animal models of 'reactive' and 'endogenous' depression. We then translated these findings to clinical samples using a human post-mortem mRNA study
    • …
    corecore